# LibriSpeech Fine-tuning

Hubert Base Librispeech Demo Colab
Apache-2.0
A speech recognition model fine-tuned from facebook/hubert-large-ls960-ft, trained on the LibriSpeech dataset
Speech Recognition Transformers
H
vishwasgautam
101
0
Wav2vec2 Base Librispeech Demo Colab
Apache-2.0
This model is a speech recognition model fine-tuned on the LibriSpeech dataset based on facebook/wav2vec2-base, achieving a word error rate of 0.3174 on the evaluation set.
Speech Recognition Transformers
W
vishwasgautam
14
0
F5 Ita Test
This is the Italian fine-tuned version of the F5-TTS model, trained on the facebook/multilingual_librispeech dataset, specializing in Italian text-to-speech tasks.
Speech Synthesis Other
F
alien79
98
2
Assignment1 Omar
Apache-2.0
Wav2Vec2 is a self-supervised learning-based speech recognition model, pre-trained and fine-tuned on 960 hours of LibriSpeech audio data, supporting English speech transcription.
Speech Recognition Transformers English
A
Classroom-workshop
28
0
Librispeech 100h Supervised Meta
Apache-2.0
Speech recognition model fine-tuned from Kuray107/librispeech-5h-supervised, trained on 100 hours of LibriSpeech dataset
Speech Recognition Transformers
L
Kuray107
25
0
Wav2vec2 100m Mls German Ft
Apache-2.0
This model is an automatic speech recognition (ASR) model fine-tuned on the German subset of the multilingual LibriSpeech dataset, based on facebook/wav2vec2-xls-r-100m
Speech Recognition Transformers
W
patrickvonplaten
27
0
Wav2vec2 2 Bart Base
A speech recognition model fine-tuned on the LibriSpeech ASR clean dataset, based on wav2vec2-base and bart-base
Speech Recognition Transformers
W
patrickvonplaten
493
5
Wavlm Base Libri Clean 100
Automatic speech recognition model based on the WavLM architecture, fine-tuned on the LibriSpeech CLEAN dataset (100 hours)
Speech Recognition Transformers
W
anjulRajendraSharma
73
0
Sew D Mid 400k Librispeech Clean 100h Ft
Apache-2.0
This model is an automatic speech recognition model fine-tuned from asapp/sew-d-mid-400k on the LIBRISPEECH_ASR - CLEAN dataset, achieving a word error rate (WER) of 1.0536 on the evaluation set.
Speech Recognition Transformers
S
patrickvonplaten
15
1
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase